Goto

Collaborating Authors

 global terrorism database


Predicting the Unpredictable: Reproducible BiLSTM Forecasting of Incident Counts in the Global Terrorism Database (GTD)

arXiv.org Artificial Intelligence

We study short-horizon forecasting of weekly terrorism incident counts using the Global Terrorism Database (GTD, 1970--2016). We build a reproducible pipeline with fixed time-based splits and evaluate a Bidirectional LSTM (BiLSTM) against strong classical anchors (seasonal-naive, linear/ARIMA) and a deep LSTM-Attention baseline. On the held-out test set, the BiLSTM attains RMSE 6.38, outperforming LSTM-Attention (9.19; +30.6\%) and a linear lag-regression baseline (+35.4\% RMSE gain), with parallel improvements in MAE and MAPE. Ablations varying temporal memory, training-history length, spatial grain, lookback size, and feature groups show that models trained on long historical data generalize best; a moderate lookback (20--30 weeks) provides strong context; and bidirectional encoding is critical for capturing both build-up and aftermath patterns within the window. Feature-group analysis indicates that short-horizon structure (lagged counts and rolling statistics) contributes most, with geographic and casualty features adding incremental lift. We release code, configs, and compact result tables, and provide a data/ethics statement documenting GTD licensing and research-only use. Overall, the study offers a transparent, baseline-beating reference for GTD incident forecasting.


Retool AI to forecast and limit wars

#artificialintelligence

Armed violence is on the rise and we don't know how to stop it1. Since 2011, conflicts worldwide have killed up to 100,000 people a year, three-quarters of whom were in Afghanistan, Iraq and Syria. The rate of major wars has decreased over the past few decades. But the number of civil conflicts has doubled since the 1960s, and terrorist attacks have become more frequent in the past ten years. The nature of conflict is changing.


Optimizing Stochastic Gradient Descent in Text Classification Based on Fine-Tuning Hyper-Parameters Approach. A Case Study on Automatic Classification of Global Terrorist Attacks

arXiv.org Machine Learning

The objective of this research is to enhance performance of Stochastic Gradient Descent (SGD) algorithm in text classification. In our research, we proposed using SGD learning with Grid-Search approach to fine-tuning hyper-parameters in order to enhance the performance of SGD classification. We explored different settings for representation, transformation and weighting features from the summary description of terrorist attacks incidents obtained from the Global Terrorism Database as a pre-classification step, and validated SGD learning on Support Vector Machine (SVM), Logistic Regression and Perceptron classifiers by stratified 10-K-fold cross-validation to compare the performance of different classifiers embedded in SGD algorithm. The research concludes that using a grid-search to find the hyper-parameters optimize SGD classification, not in the pre-classification settings only, but also in the performance of the classifiers in terms of accuracy and execution time.


Open Data Spotlight: The Global Terrorism Database

#artificialintelligence

Publishing data on Kaggle is a way organizations can reach a diverse audience of data scientists with an enthusiasm for learning, knowledge, and collaboration. For Dr. Erin Miller of START, the National Consortium for the Study of Terrorism and Responses to Terrorism, making her organization's Global Terrorism Database available for analysis by Kaggle users has brought new awareness to their cause. In this Open Data Spotlight, Erin discusses how setting aside agendas and focusing on understanding this unparalleled dataset of over 150,000 attack events allows users to undertake constructive analyses that may defy common conceptions about terrorism. Read on to learn more about the Global Terrorism Database project and the ways users of open data can make valuable contributions to the organizations that make them possible. My role started out (more than 12 years ago) as a graduate assistant cleaning raw data, and now I manage the project team, workflow, resources, and interaction with end users and related research projects.